p 

!U 
ITJ 



Script 



Input Data 



Analyze Script 



3 

Parallel 
Computation 



4 

Script 
Fragments 



X 



Runtime System 



Original Tool 



Supplemental 
Programs 



Output Data 



y 

'0 

*0 



FIG. 1. Parallelization Method 



• 



m 



sirs 



I 



2 Analyze Script 



Initial Script 



f 



101 * 



Divide into Statements 



Sequence of Statements 



Construct Serial 
Dataflow Graph 



Serial Dataflow Graph 



Parallelize 



106 



Parallel Dataflow Graph 



Generate Script Fragments 



V 



108 



Produce Parallel Specification 



Specification of Parallel 
Computation 



Script 
Fragments 



FIG. 2. Method for Analyzing Scripts 



(202 



indatal 



indata2 



203 

Step 1: CONCATENATE 




222 


r 204 

tempi 




223 


205 

Step 2: AGGREGATE 




224 


r 206 

temp2 




225 


207 

Step 3: ANALYZE 




226 


^08 ^ 

outdata 




j 



FIG. 3. Serial dataflow graph 




5=5 H 



203 




Step 1: CONCATENATE 




222 


i 




204 




tempi 








231 




r 


214 




Step 2a: Hash Partition 


By v2, v3 


232 




r 


'215 




temp5 







By v2, v3 



By v2, v3 



201 


\ 

indatal 

j 


* 


202 


indata2 




227 






229 


210 


Step la: Partition 




211 


Step 1 b: Partition 




228 

r 






230 

r 


212 


temp3 




213 


temp4 






v° 




/ 


^221 



223 



FIG. 4. Parallel Dataflow Graph 



205 




Step 2: AGGREGATE 


By v2, v3 


224 


i 




206 




temp2 






By v2, v3 


^233 


216 




Step 3a: Gather 




234 


f217 


t — - .. ^ 


temp6 




225 


207 




Step 3: ANALYZE 




226 


208 




outdata 







By v2, v3 



By v2, v3 



jn£> 



INPUT indatal 


inputl . 


dat 


INPUT indata2 


input2 . 


dat 


CONCATENATE i 


ndatal indata2 


AGGREGATE ol 


= MIN vl 


, o2 = MAX vl, o3 = SUM vl BY v2 , v3 


ANALYZE 






OUTPUT outdat 


a output 


.dat 



Script _ 



Jindatal: INPUT indatal inputl.dat 



Jindata2: INPUT indata2 input2.dat 



Step I: CONCATENATE indatal indata2 



Step2: AGGREGATE ... BY v2, v3 



Step3: ANALYZE 



H 201 
1 202 



Joutdata: OUTPUT outdata outputdat 



203 
205 
207 
208 



m 

i W 

m 
*0 



FIG. 5. Original Script and division into processing steps 





Name 


Class 


File 


Explicit? 


201 


indatal 


INPUT 


inputl.dat 


Yes 


202 


indata2 


INPUT 


input2.dat 


Yes 


208 


outdata 


OUTPUT 


output.dat 


Yes 


204 


temp 1 


TEMP 




No 


206 


temp2 


TEMP 




No 








Data Set Table 
















Name 


Operation 


Parameteres (excepting data sets) 




203 


Stepl 


CONCATENATE 






205 


Step2 


AGGREGATE 


o 1 = min v 1 , o2 = max v 1 , o3 - avg v 1 










BY v2, v3 




207 


Step3 


ANALYZE 












Processing Step Table 





601 



602 



111 

lit 2 



m 





Step 


Data Set 


Direction 


Role 


220 


Stepl 


indatal 


input 


in 


221 


Stepl 


indata2 


input 


in 


222 


Stepl 


tempi 


output 


out 


223 


Step2 


tempi 


input 


in 


224 


Step2 


temp2 


output 


out 


225 


Step3 


temp2 


input 


in 


226 


Step3 


outdata 


output 


out 



Data Set Access Table 



603 



£ c 



FIG. 6. Serial Dataset, Processing Step, and Dataset Access Tables 



ill 




Start 



500 

Initialize Data Set Table (601) 
Initialize Processing Step Table (602) 
Initialize Data Access Table (603) 



501 

Find all data sets & global 
variables and create entries in 
Data Set Table (601) 



-ssr 

iU 

::; 
: -~ 

*. -TH 



: a 




512 






Done 



503 

Select next 
Processi 


unanalyzed 
ng Step 




r 


504 

Create Entry in Process 
Step Table (602) 






505 

For each explici 
create entry in D 
(6C 


t I/O reference, 
ata Access Table 
3) 




Yes 



507 

Determine implicit data 
set based on context 



Create entry in 
Data Access Table (603) 



Yes 



510 



Create Intermediate 

Data Set entry in 
Data Set Table (601) 



511 



Create entry in 
Data Access Table (603) 



FIG. 7. Method for converting sequences of steps to serial dataflow graph 




/ v : \ 



903 


CONCAT or COPY processing 
step 


Parallelize by simple partitioning 


904 


Processing step contains BYclause 


Parallelize by key-based partitioning 


905 


INPUT, OUTPUT, or TEMP data 
set accessing partitioned file 


Parallelize by expansion of the partitioned file. 


906 


AGGREGATE operation 


Parallelize by splitting into global and local stages. 


907 


INDB data set accessing 
partitioned database table. 


Parallelize by adding an external "db unload" process. 


908 


OUT DB data set accessing 
partitioned database table 


Parallelize by adding an external "db load" process. 



□ 

if* 

FIG. 8. Parallelization repertoire 



ru 

i 

|0 

ii 

n 

r — 

i 

w 



Start 



700 

Initialize Parallel Data Set Table (801) 
Initialize Parallel Processing Step Table (802) 
Initialize Parallel Data Access Table (803) 



111 

® 



P 
ins? 






707 

Resolve Partitioning 
Conflicts 






708 
Done 









Yes 



702 

Select an unprocessed 
Step from the processing 
step table (802) 



703 

Consult parallelization repertoire 
(900) to locate best applicable 
parallelization technique 



Yes 




705 

Apply rewrite rule (902) 



706 

Mark vertex as "serial" 



FIG. 9. Method for Parallelizing serial dataflow graph 




t 





Name 


Class 


File 


Explicit? 


Partitioning 


201 


indatal 


INPUT 


inputl.dat 


Yes 


Serial 


202 


indata2 


INPUT 


input2.dat 


Yes 


Serial 


208 


outdata 


OUTPUT 


output.dat 


Yes 


Serial 


204 


tempi 


TEMP 




No 




206 


temp2 


TEMP 




No 










Parallel Data Set Table (initial) 





801 





Name 


Operation Parameters (excepting data sets) 


Partitioning 


203 


Stepl 


CONCATENATE 






205 


Step2 


AGGREGATE ol=minvl,o2 


= max v 1 , o3 = avg v 1 








BY v2, v3 






207 


Step3 


ANALYZE 










Parallel Processing Step Table (initial) 
















Step 


Data Set Direction 


Role 


Partitioning 


220 


Stepl 


indatal input 


in 




221 


Stepl 


indata2 input 


in 




222 


Stepl 


tempi output 


out 




223 


Step2 


tempi input 


in 




224 


Step2 


temp2 output 


out 




225 


Step3 


temp2 input 


in 




226 


Step3 


outdata output 


out 








Parallel Data Set Access Table (initial) 





802 



803 
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FIG. 11. Parallel Dataset, Processing Step, and Dataset Access Tables, after parallelizing processing steps. 
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FIG. 13. Parallel Dataset, Processing Step, and Dataset Access Tables, after resolving Step 1 
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FIG. 14. Parallel Dataset, Processing Step, and Dataset Access Tables, after resolving Step 2 
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FIG. 15. Parallel Dataset, Processing Step, and Dataset Access Tables, after resolving Step 3 
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FIG. 18. Parallel Dataset, Processing Step, and Dataset Access Tables, after generation of script fragments 
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FIG. 20. Parallel Dataset, Processing Step, and Dataset Access Tables, after defining temporary datasets 
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FIG. 21. Specification of parallel computation 
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FIG. 23. Key-based Partitioning 
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FIG. 25. Local-Global Division Parallelization 
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FIG. 27. Statement Decomposition 
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FIG. 29. Local-Global Division for SAS FREQ process 
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